Search CORE

5 research outputs found

Automated Unit Testing of Evolving Software

Author: Shamshiri Sina
Publication venue: 'University of Sheffield Conference Proceedings'
Publication date: 01/10/2016
Field of study

As software programs evolve, developers need to ensure that new changes do not affect the originally intended functionality of the program. To increase their confidence, developers commonly write unit tests along with the program, and execute them after a change is made. However, manually writing these unit-tests is difficult and time-consuming, and as their number increases, so does the cost of executing and maintaining them. Automated test generation techniques have been proposed in the literature to assist developers in the endeavour of writing these tests. However, it remains an open question how well these tools can help with fault finding in practice, and maintaining these automatically generated tests may require extra effort compared to human written ones. This thesis evaluates the effectiveness of a number of existing automatic unit test generation techniques at detecting real faults, and explores how these techniques can be improved. In particular, we present a novel multi-objective search-based approach for generating tests that reveal changes across two versions of a program. We then investigate whether these tests can be used such that no maintenance effort is necessary. Our results show that overall, state-of-the-art test generation tools can indeed be effective at detecting real faults: collectively, the tools revealed more than half of the bugs we studied. We also show that our proposed alternative technique that is better suited to the problem of revealing changes, can detect more faults, and does so more frequently. However, we also find that for a majority of object-oriented programs, even a random search can achieve good results. Finally, we show that such change-revealing tests can be generated on demand in practice, without requiring them to be maintained over time

White Rose E-theses Online

Revisiting the Relationship Between Fault Detection,Test Adequacy Criteria, and Test Set Size

The research community has long recognized a complex interrelationship between test set size, test adequacy criteria, and test effectiveness in terms of fault detection. However, there is substantial confusion about the role and importance of controlling for test set size when assessing and comparing test adequacy criteria. This paper makes the following contributions: (1) A review of contradictory analyses of the relationship between fault detection, test suite size, and test adequacy criteria. Specifically, this paper addresses the supposed contradiction of prior work and explains why test suite size is neither a confounding variable, as previously suggested,nor an independent variable that should be experimentally manipulated. (2) An explication and discussion of the experimental design and sampling strategies of prior work, together with a discussion of conceptual and statistical problems, and specific guidelines for future work. (3) A methodology for comparing test-adequacy criteria on an equal basis, which accounts for test suite size by treating it as a covariate. (4) An empirical evaluation that compares the effectiveness of coverage-based and mutation-based testing to one another and random testing. Additionally, this paper proposes probabilistic coupling, a methodology for approximating the representativeness of a set of test goals for a given set of real fault

CISPA – Helmholtz-Zentrum für Informationssicherheit

Crossref

How Do Automatically Generated Unit Tests Influence Software Maintenance?

Author: Gordon Fraser (7599587)
José Miguel Rojas (7777556)
Juan Pablo Pablo Galeotti (7777559)
Neil Walkinshaw (7599545)
Sina Shamshiri (7777553)
Publication venue
Publication date: 28/05/2018
Field of study

Generating unit tests automatically saves time over writing tests manually and can lead to higher code coverage. However, automatically generated tests are usually not based on realistic scenarios, and are therefore generally considered to be less readable. This places a question mark over their practical value: Every time a test fails, a developer has to decide whether this failure has revealed a regression fault in the program under test, or whether the test itself needs to be updated. Does the fact that automatically generated tests are harder to read outweigh the time-savings gained by their automated generation, and render them more of a hindrance than a help for software maintenance? In order to answer this question, we performed an empirical study in which participants were presented with an automatically generated or manually written failing test, and were asked to identify and fix the cause of the failure. Our experiment and two replications resulted in a total of 150 data points based on 75 participants. Whilst maintenance activities take longer when working with automatically generated tests, we found developers to be equally effective with manually written and automatically generated tests. This has implications on how automated test generation is best used in practice, and it indicates a need for research into the generation of more realistic tests

Crossref

Leicester Research Archive

Do Automatically Generated Unit Tests Find Real Faults? An Empirical Study of Effectiveness and Challenges (T)

Author: Andrea Arcuri (7138889)
Gordon Fraser (7599587)
José Miguel Rojas (7777556)
Phil McMinn (91039)
René Just (7781621)
Sina Shamshiri (7777553)
Publication venue
Publication date: 01/01/2015
Field of study

Rather than tediously writing unit tests manually, tools can be used to generate them automatically - sometimes even resulting in higher code coverage than manual testing. But how good are these tests at actually finding faults? To answer this question, we applied three state-of-the-art unit test generation tools for Java (Randoop, EvoSuite, and Agitar) to the 357 real faults in the Defects4J dataset and investigated how well the generated test suites perform at detecting these faults. Although the automatically generated test suites detected 55.7% of the faults overall, only 19.9% of all the individual test suites detected a fault. By studying the effectiveness and problems of the individual tools and the tests they generate, we derive insights to support the development of automated unit test generators that achieve a higher fault detection rate. These insights include 1) improving the obtained code coverage so that faulty statements are executed in the first instance, 2) improving the propagation of faulty program states to an observable output, coupled with the generation of more sensitive assertions, and 3) improving the simulation of the execution environment to detect faults that are dependent on external factors such as date and time

Crossref

Open Repository and Bibliography - Luxembourg

Leicester Research Archive

Using Relative Lines of Code to Guide Automated Test Generation for Python

Author: Abreu F. Brito
Abreu Fernando Brito
Ahmed Iftekhar
Andersson C.
Andrews James
Andrews James H.
Arcuri Andrea
Bansiya J.
Buchgeher G.
Cadar Cristian
Carlson R.
Chen Yuanliang
Claessen Koen
Dolado J. J.
Dwyer Matthew B.
Ekelund E. D.
Emam Kalhed El
Engström E.
Fatiregun D.
Fenton N. E.
Gay Gregory
Gligoric Milos
Godefroid Patrice
Gopinath Rahul
Gopinath Rahul
Groce Alex
Groce Alex
Groce Alex
Groce Alex
Groce Alex
Groce Alex
Hamlet Richard
Harman Mark
Herzig Kim
Hirzel Matthias
Holzmann Gerard
Holzmann Gerard
James
Jefferson Offutt A.
Just René
Koru A. G.
Lionel
MacIver David R.
Marcus Andrian
Marijan D.
Marijan D.
Marinescu Paul Dan
McCabe T. J.
McKeeman William
Menzies T.
Mustafa
Olague Hector M.
Pacheco Carlos
Pai G. J.
Papadakis Manolis
Radjenović Danijel
Saha Ripon K.
Seonghoon Kang
Shamshiri Sina
Skoglund M.
Staats Matt
Tahvili Sahar
Tomassi David A.
White L.
White Lee
Wikstrand G.
Yang Qian
Yatoh Kohsuke
Zhang Chaoqiang
Zheng Jiang
Zhou Yuming
Zimmermann T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref